Bayesian Enhancement of Speech and Audio Signals Which Can Be Modelled as Arma Processes
نویسنده
چکیده
In application areas which involve digitised speech and audio signals, such as coding, digital remastering of old recordings and recognition of speech, it is often desirable to reduce the eeects of noise with the aim of enhancing intelligibility and perceived sound quality. We consider the case where noise sources contain non-Gaussian, impulsive elements superimposed upon a continuous Gaussian background. Such a situation arises in areas such as communications channels, telephony and gramophone recordings where impulsive eeects might be caused by electromagnetic interference (lightning strikes), electrical switching noise or defects in recording media, while electrical circuit noise or the combined eeect of many distant atmospheric events lead to a continuous Gaussian component. In this paper we discuss the background to this type of noise degradation and describe brieey some existing statistical techniques for noise reduction. We propose new methods for enhancement based upon Markov chain Monte Carlo (MCMC) simulation. Signals are modelled as autoregressive moving-average (ARMA), while noise sources are treated as discrete and continuous mixtures of Gaussian distributions. Results are presented for both real and artiicially corrupted data sequences, illustrating the potential of the new methods. pour illustrer les capacit es de ces m ethodes.
منابع مشابه
A Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement
A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...
متن کاملA model for an ARMA process split in sub-bands
Multi-rate digital processing [1] is today a well-established topic, extensively applied in communications, image and audio industry and other areas, for signal coding, adaptive or statistical processing etc. A special class of discrete random processes [2] are those obtained by passing white-noise through a linear digital filter—called Moving-Average (MA) for an Finite-Impulse-Response (FIR) f...
متن کاملSpeech Enhancement Through an Optimized Subspace Division Technique
The speech enhancement techniques are often employed to improve the quality and intelligibility of the noisy speech signals. This paper discusses a novel technique for speech enhancement which is based on Singular Value Decomposition. This implementation utilizes a Genetic Algorithm based optimization method for reducing the effects of environmental noises from the singular vectors as well as t...
متن کاملSpeech Enhancement Through an Optimized Subspace Division Technique
The speech enhancement techniques are often employed to improve the quality and intelligibility of the noisy speech signals. This paper discusses a novel technique for speech enhancement which is based on Singular Value Decomposition. This implementation utilizes a Genetic Algorithm based optimization method for reducing the effects of environmental noises from the singular vectors as well as t...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کامل